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Abstract 

Protensoiiies are large, multisubunit proteases wuh highly consented structures. Tlie 2<5S proteasome of eukaryolcs 
is an ATP-dcpcndcnt enzyme of abont 2 MDa. which acts as the central pu)leasc of the ubiquitia-depcndent pathway 
of protein degradation. The core oflhc 26S complex is formed by the 20S proteasome, an ATP-indcpcndcni, barrel- 
shaped protease of about 700 kDa, which has also been detected in archacbacieria and, inore recently, in eubacteria. 
CufTcnily, the distribution of 20S proteasomes in eubacteria appears limited to the aclinomycetes, while most other 
eubacteria contain a related ctimplex of simpler structure. 



Introduction 

Hie 26S proteasome is a large ATP-dcpcndcnt proteo- 
lytic complex found in the cytosol and nucleus of all 
eukaryotic cells, li is an abundant protein^ which rep- 
resent ilie core enzyme of the rkin-lysosomal path of 
prolcin degradation and tiius plays a key role in many 
cellular processes (reviewed in [1-31). 

The 26S complex consists of two asymmetric 19S 
caps flanking a barrel-shaped 20S core (die 20S pro- 
teasome) [4]. The I9S caps contain the ATPasc func- 
tion and select the tai gets for degradation, which they 
present in unfolded form to the 20S core. The caps con- 
sist of approximately 1 5 different subunils between 25 
and I (0 kDa, six of which are ATPases t>elonging to 
the AAA family [5-8]. 

Tl^ 20S core degrades the substrates to peptides 
of approximately 4-10 residues length [9, 10]. It con- 
sists of 14 different but related subunits of about 22- 
35 kDa, which can be divided into an o:-iypc and a 
/?-iypc group on the basis of their sequence similar- 
ity (Figure I) f 1 1, 12]- The subunils arc arranged into 
four seven-membered rings (13, 14], with the a-type 
subunits forming the outer and the .5-lype subunits the 
inner rings of the complex [15-18]. Collectively, die 



four rings form an elongated cylinder of 15 nra length 
and \ 1 nm diameter [14, 19-21], which is traversed 
from end to end by a channel and contains three large 
inner cavities bounded by four narrow constrictions at 
the level of the a- and j^-subunit rings (14, 22], Sub- 
strates penetrate into flie proteasome via the central 
channel; as expected from the dimensions of the o- 
ring constrictions, only entirely unfolded polypeptide 
chains can pass [23]. By a still unknown mechanism, 
substrates wind their way through the outer cavities to 
the proteolytic active-site clefts in the central cavity. 

As anticipated from their sequence similarity, a- 
and y?-type ^subunits have the same fold [14]: a four- 
layer a + 5 stnicture with two central antiparalicl ^- 
.sheets, flanked on either side by a-helices. The /3-sheei 
sandwich is open at one end to form the active-site 
cleft. The main difference between a- and ^typc sub- 
units lies in a highly conserved N-terminal extension 
of a-subunits, which fonns an tv-helix across the lop 
of the ccnira] /9-sandwich, thus filling the active-site 
cleft Correspondingly, only the /J-iypc subunits have 
proteolytic activity. The function of the N-terminal 
extension is not clear, but its location at the top of the 
a-rings close to the entrance of the cliannci indicates 
that it may be important for interactions between tlie 
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ngttre L Dendrogram of 20S proteasonic subunHs. sbowku; Uieir 
classification into o-typc and i?-lypc sctpciiocs. Archaebacterud 
branches arc dotted «kJ CttbacCei-ial hraiicliu are bold. The orgun> 
isnis arc: Hs, Hatuo sapiens; Ta. Hiennopiamut ocidophUum., Rc 
RhfMlotrticcus €tythmpolix\ Ml, MycQbactenum Icf^c: Ec. Escheri 
chia cvlh Bb. iktciltus suhtUis\ Ph, Pasttturflia hannotwca\ Ae« 
Akaiigfnes eutrophm. The division into actKr and inactive i3Ay\yc 
subunit:> was deduced from tlic conservation pattern of active-site 
residues (3 1] aiid from ihc locaiicm of t(K piTiccssing site (Fjgutr 2). 
For the three branches of activit human j5-typc subunits, the ^ 
intcrfcfun iothicJbie alternates are also shown. 



proteasome and its regulatory complexes, as well as 
for guidance of (he unfolded substrates dirough riie 
channel In place of the N-ierniinal ci-helix, /5-type 
subunits contain a prosequence (Figure 2) f 1 6, 24-28], 
which is cleaved off during proteasome assembly to 
free the active site [29. 30J. The cleavage occurs on the 
N-terniinal side of a threunine residue, which may act 
as both the catalytic nucleophile and die primary pro- 
ton acceptor in iJie mature subunit f ] 4, 3 1-33]. Beside 
the N-terminal threonine, p subunits require two fur- 
ther residues for activity, whose exact role remains to 
be clariOed: a lysine residue und u negatively charged 
residue, which form a salt bridge across the bottom of 
the active site and may participate in the dclocalization 
of the threonine side-chain proton by forming a charge 
relay system [14, 3 1 ]. 

26S and 2US prolcastnnes are ubiquitous in euka- 
ryotcs. 20S (but not 26S) protea somes have also been 
discovered in the archaebacteria ThermopUunna acido- 
phUum 121], Methanosarvwa tJtennophila [34J, and 
(by genomic sequencing) in Methanococciisjansiaschii 
(35 J. I'hese are composed of only one a and one 0 sub- 



unit and thus have a simpler stniciun: than eukaryot- 
ic proteasomes, but arc otherwise indistinguishable in 
their quaternary stnicture at low resolution (1.5 nm). 
Recently, 20S proteasomes (28] as well as simpler 
coniplexes corresponding to the two central protea- 
some rings f36J have been discovered in eubacteria. 



Pmteasome sequences in eubacteria 

The first report of multicuialytic cubacterial proteases 
possibly related in the proteasome u'as made from the 
actinomycete Frankia stiain BR (371. Several pn>- 
teolytic complexes between 270 kDa and 1.3 MDa 
were identified in crude extntcts, but the low amount 
of material, which was due to the extremely slow 
growth of the organism, precluded the purification and 
sequence determination of these complexes. 

The similarity between a eubacteria! sequence and a 
proteasome subunit was first noted during the Escheri- 
chia coll gcntime project for the heat-shock locus V 
gene {hsiV) (38]. A search of sequence dalaha.sc.s 
tevealed the existence of several cuhacteriaJ proteins 
related to proteasome j9-type subunits 139|, These fell 
into two groups (Figure I): the sequences from R. coli, 
Pasteurella haenwfyn'ca^ Bacillus subrdis and Alcali- 
^enes eiutvphus, which are all HslV homologues and 
contain eitlier no prosequence or only a short one, and 
the sequence from the actinomycete Mycobacterium 
leprae^ which resembles eukaryotic and archaehacceri- 
al subunits more closely and contains a prosequence 
of 56 residues (Figure 2). ITie HslV homologues (but 
not the M. leprae sequence) were found to occur in an 
openm with an ATPase of the CIp family (Figure 3), 
suggesting that they fonn a hybrid structure combining 
features of the Clp protease with those of the protea- 
some. In contrast, the sequence of M. leprae^ which has 
greater similarity to eukaryoiic sequences and coniains 
a large prosequence, was proposed to form a structure 
of the 20S type. 

Independently, an operon with striking similarity 
to the Af leprae proteasome gene region was found 
during the sequencing of the the gene cluster in the 
soil actinomycete Rhodococcus eryHiropolUi NI86/21 
(28. 40]. The similarity included a sequence identity of 
61-81% on the protein level, and, on the DNA level, 
a iranslational coupling between llie three genes in the 
operon. In addition to the proteasome /5-type sequence, 
the opcrim was proposed to also contain a liighly diver- 
gent a-type sequence, supporting die hypothesis that 
actiiiomycetes contain a proteasome of the 20S type. 
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Figure S. Operon structure of actincmiyoetal 20S subunits (A), and of the HsIVU protease (B). ^-lyiie Rubunlt genes (prxrtf A pn^2, pfcB, 
hsiV), o-type subunit genes ^rcAL prtA2. €3-260^, and ATPasc genes (AAA ATPase, C/-/67. hstlf) arc labeled io boldface. The the genes, 
whose cluster extends bcycmd the rcgkm $hown here, eiicude pniteins rcquinrd fur ihiocarfoamate dcgradarkM [40]. RLEP is a oo^iy of the 
M. lepraC'S(foc\iic dispersed repetitive j;cqucnce. 



Because M, leprae is a is an obligateiy intracellular 
pathogen that cannot be cultivated in vitm, the sub- 
sequent effort to isolate a eubacterial proteasoine was 
concentrated on Rhodococcus. 



The pruteasome of Rhodococcus erytkropolis 

Isolation and hiodmrnical cliaracteriwion 

The proteolytic activity of 20S proteasomesis resistant 
to low amounts of SDS. This property was exploited 
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in order to idcnlify the tniclion of Rkodococa4s celt 
extract containing the 20S proreasonne [28]. The conn- 
plex was then purified to more than 95% homogeneity 
by four successive cJironiatography steps. The sub- 
strate specilicity of the particle as assayed with fluori- 
genie peptides was similar to that of the archaebacterial 
proteasomefrom ThennopUisma, Jbligh levels of activ- 
ity were mcttsurcd t<)r most chymotryptic substrates, 
whereas tryptic and petidyl -glutamyl substrates were 
not measurably degraded. It should be noted though 
that this spcciiicity has little (if any) relevance for the 
actual activity against protein substrates and thus for 
the in vivo situation, as was shown for the proteasomes 
from Tliemoplaxma and man [9, lOJ, It is clear from 
the ccystal structure of the Thermoplasma proteasomc 
diat a classification of specificities based purely on the 
residue at the PI position must fall ,short of reality, 
given the multiple interactions the proteasome makes 
witlt residues up to position P4, and possibly beyond 
114]. 

SDS-polyacrylamide gel electrophoresis of the 
Rhffdw:fjccux proteasomc surprisingly yielded four 
different bands of similar intensity, rather than the 
two expected from the operon sequence [28]. Protein 
sequence analysis by Kdman degradation showed that 
the two top bands corresponded to similar but non- 
idcntlcal o-typc subunits and the two bottom bands to 
^-iype subunits. Only one each of the <k- and 0'iypc 
subunits were encoded in the alrea<ly known operon of 
proteasomc genes, indicating the presence of a hitherto 
unknown second opertjn, which was identified by gene 
hybndizadon (Figure 3). The two operons are very 
similar (about 80% sequence identity on the protein 
level) and have obviously diverged much more recently 
that the operons ot Rkodococcus and Mycobacterium^ 
but have a different GC content A screen of other 
R. eryiliivpolis stn>ins yielded in alt cases only one 
operon corresponding to the second operon of strain 
NI86/21 (T. Nagy and R. De Mot, unpublished). This 
operon has a GC content of 63% while the first oper- 
on of strain NI86/2I has a significandy higher content 
(67%), indicaung that it was acquired by horizontal 
transfer from a related aclinomycete, rather than ori- 
ginadng by gene duplication. The occurrence of only 
one operon of proteasome genes in most R. eryfhmpol- 
is stums indicates that four subunits arc the exception 
rather than the rule in eubncterial proteasomes. 
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Qaatemary structure \ 

At the resolution of electron micrographs, the pn>tea- + 
some from Rhodococcus was indistinguishable from | 
archaebacterial and cukaryolic proteasomes 128 J . Neg- | 
atively stained preparations showed the two character- t 
isuc views: ring-shaped end-on views of about 1 1 nm I 
diameter and rectangular side-views of 15 nm length. i 
Because it is notoriously difficult to analyze Uic sym- | 
racuy of pardcles formed by several rings stacked out | 
of register (especially as even small tilts of the cylinder | 
axis cause severe distortions in the projection images), j* 
it was not possible to determine from image analysis | 
of end-on views whether the rings are formed by six » 
or seven subunits. Nevertheless, the similarity of the | 
side-views to tfiose of proteasomes from other organ- i- 
isms indicates that the rings are seven-membered. To | 
decide this question and to determine the position of | 
the different subunits in the complex, an expression jj 
system was fudi up in E. coli^ in which the four sub- | 
units could be expressed in all possible combinations li 
(F. Zuhl and W. Baumcistcr, unpublished). ji 
All four combinations of one n and one fl sub- li 
unit yielded fully assembled and proteolyiically act- | 
ivc complexes, which arc indistinguishable from each | 
o(hcr and from native proteasomes in electron micro- t 
graphs (Figure 4). Obviously the subunits luive not | 
diverged to an extent that would prevent productive |i 
inieracrion. In all four cases as well as for the pixuca- p 
some isolated from Rhodococcus. native gel electro- | 
phoresis yielded sharp single bands with an apparent | 
molecular mass corresponding to complexes formed ji 
by scven-mcmbcrcd rings. This result strongly favors j: 
a (pseudo)sevenfold symmetry for the Rhodacoccux \ 
proteasome. Mixtures of two rccombinant proteasomes i 
could be clearly separated by native gel clcctrophorcs- | 
is, excluding The possibility* dial die Rhodococcus pro- f 
leasomc is in fact a mixture of two distinct com- l 
plexcs. This is supported by measurements of the K,„ , ; 
which show the constants of the four recombinant pro- 
teasomes to be very different (and much lower) than ^ 
the constant of the native complex (Figure 4). Such a 
result could not be obtained by simple mixtures of dif- ^ 
fcrent complexes and i ndtcatcs that die two ^ .subunits >: 
interact in the native complex. Two stibunit topologies 
remain possible which cannot be rcsol ved at this point- 
stacks of four different homooligomeric rings or a and ^ 
0 rings wiUi a random subunit distribution. 
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Aijtfu/tf 4. Rlcctinn micrDgniphs of native and recombuiaiU Riwdocaccus pnueu*»mes with ihe corrcspnnding averages of the side views at tiicr- 
bdiuim. Al ihe uip Jire listed rhc respective K values. 



Processing cuid assemhly 

Both 0 Kubunits of Rhod^KcKcus an synthesized in an 
inactive precarsor form and arc activated by cleavage at 
an iiUemat Ihrcontne residue during maturation of the 
protcasome (28]. When the 0 sijbunits are expressed 
without a subunits they remain unprocessed (and there- 
fore inactive), w> expected from experiments with the 
Thennoplastfia pcoreasonte. The fact that the 0 sub- 
units ane processed conecdy in all four tecombinant 
protcasomes described in the previous section shows 
that in Rkodoc<Kcus^ both a subimils can support the 
processing of both subunits. 

Unlike Therntophsma, where the a subunits 
assemble spontaneously into sevcn-metnbercd rings 
129J, the a subunits of Rhodococcus are not cap- 
able of self-assembly eidter when expressed separ- 
ately or jointly (F. Ziihl and W. Baumeister. unpub> 
lishcd). Preliminary results also indicate that unlike 
in Thermaphsma but like in eukaryotic proteasoraes 
141, 42 J, the assembly proceeds via inrcmiediate 'half- 
proteasomes*. This points to significant dilTercnccs 



between the assembly pathways in 'JTitrmctpLisina and 
Rh<uIocacc(ts, 

Is the Rhodococcus pniteasonu: vf eukaryotic origin? 

Prolcasomcs have so far been detected in several actin- 
omycctes but in no other eubaclerla, and the genome 
sequences of Haemophilus inftuettzoc [43] and Mycv- 
piasma genizalium [44] show them to be absent from 
these ofganisins. It Iherefoiv seems conceivable that 
the actinomycetes have acquired the proteasome genes 
by horizontal gene transfer after their separation frum 
other Cram-posiiivc bacteria. Features of the assembly 
pathway, the sensitivity to inhibitors and titc lengtli of 
the P-typc subunit prosequences all point to u great- 
er similarity between the eubacterial and eukaryotic 
proteasomes, indicating that diis transfer may have 
occurred from a eukaryotic organism. 

In eukaryotes, the 20S proteasome interacts with 
die 19S caps, which contain a set of ATPa.ses of die 
AAA family [8, 45-481, fntriguingly, an ATPase of 
the AAA family is found in close proximity to the 
proteasome genes in RhtHiococcus and Mycobacieri- 
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um. Although its .sequence is too divei^eni to ailow 
its assignment t(i the 19S group of ATPases, it con- 
tains several features indicating that il in fact belongs 
to diis group: It has the sarnc domain structure con- 
sisting of an N-(jcnnina) coiled coil, followed by a 
region of about 60 residues, and ending with die AAA 
ATPase domain, which extends to the C-tcrminus of 
the protein. Within AAA ATPases, N-terminal coiled- 
coil segments have so far only beeji found in the I9S 
group [12]. Also* both the RhoeUjCfKcus and the Myco- 
baaerium ATPases contain a tyrosine as the one-but- 
iast residue, a feature that appears to be character- 
istic for the 19S and CDC48/p97 groups of AAA 
ATPcises. Tlie ATPase fnini Rhodococcus has been 
cloned, sequenced, and expressed in E, coli (S. Wolf, 
R. T>e Moc, and W. BaumeisicL unpublished). It has a 
low, N-cthy I malcimidc-sensitivc ATPase activity typ- 
ical of AAA ATPases and forms a cylindrical complex 
of two six-inembcrcd rings. This complex forms :ind is 
stable in the absence of ATP. An interaction between 
this ATPase and the Rhodococcus proteasoine is being 
explored but has not yet been demonstrated. 



The HslV protease 

Although most eubacleria do not appear to contain 
a 20S pioteasome, they contain a related protease 
complex of simpler quaternary structure. This com- 
plex, HsIV, is formed by only two stacked rings of 
/?-typc subiinits, with an overall height of 8 nm and 
a diameter of 1 1 nm (M, Rotirwild, A. Goldberg, 
and W. BaunKister, unpublished). Tt essentially cor- 
responds to die two central rings of the proteasome; 
however, HsIV is built of six-membered rings. 

The HsIV protease occurs in an operon with an 
ATPase of the CIp family, HsIU (38J. This arrange- 
ment of a protease with an ATPase in one operon par- 
allels the operon structure of die ClpXP protease (49 J, 
which is al.*io heat-shock inducible, except that here, 
a Clp-type ATPa.se is paired with a proteasome-type 
protease. Bodi HslU and HsIV have been purified to 
apparent homogeneity 136, 501 The HslU subunits 
can associate into six- and seven-memt)ered rings in 
the presence of ATP or non-hydrolyzable analogues. 
Tlie HslU rings can then bind to both ends of the HsIV 
core protease. This association stimulates the proteo- 
lytic activity of HsIV by two orders of magnitude in 
die presence of ATP but not^in the presence of non- 
hydrolyzable analogues. 



h\ iheir general architecture, the HslVU and CIp 
protease complexes resemble the 26S proteasome: the 
proteolytic activity is confined to ttie innermost cav- 
ity of a molecular microcoinpartmcnl. well segregated 
from the cellular environment and guarded by ATPase 
rings which select and unfold die proteins taigeted for 
degrjtdarion. It dierefore seems likely that correspond- 
ing ATPase complexes will a1sf> be shown to exist for 
the Rhodococcus20$ proteasome. 
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